Automatic Acquisition of Annotated Training Corpora for Test-Code Generation
نویسندگان
چکیده
منابع مشابه
Automatic generation of visual scenarios for spoken corpora acquisition
The paper describes a system, in JAVA, for written and visual scenario generation to collect speech corpora in the framework of a Tourism Information System. Methods and experimental results are also presented for evaluating the degree of understanding of the proposed scenarios. The corpus generated from visual scenarios appears to be much richer than the one generated from textual descriptions.
متن کاملAutomatic Error Detection in Annotated Corpora
Annotated corpus is a linguistic resource which explicitly encodes the information at syntactic and semantic levels for each sentence. Annotated corpora play a crucial role in many applications of natural language processing (NLP). Error free and consistent annotated corpora is vital for these applications. Creating annotated corpora is an expensive and time consuming process. Errors or anomali...
متن کاملTraining Dependency Parsers from Partially Annotated Corpora
We introduce a maximum spanning tree (MST) dependency parser that can be trained from partially annotated corpora, allowing for effective use of available linguistic resources and reduction of the costs of preparing new training data. This is especially important for domain adaptation in a real-world situation. We use a pointwise approach where each edge in the dependency tree for a sentence is...
متن کاملRAVEL: An Annotated Corpora for Training Robots with Audiovisual Abilities
We introduce a publicly available data set which covers examples of Human Robot Interaction (HRI) scenarios. These scenarios are recorded using the audiovisual robot head POPEYE, equipped with two cameras and four microphones. All the recordings were performed in a standard meeting room enclosing all the challenges of natural indoor scenes. This data set provides a basis to test and benchmark m...
متن کاملAutomatic Acquisition of Sense Tagged Corpora
An important problem in Natural Language Processing is identifying thecorrect sense of a word in a particular context. Thus far, statistical methods have been considered the best techniques in word sense disambiguation. Unfortunately, these methods produce high accuracy results only for a small number of preselected words. The reduced applicability of statistical methods is due basically to the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information
سال: 2019
ISSN: 2078-2489
DOI: 10.3390/info10020066